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Abstract 

We study several statistical mechanical models on a general tree. Particular at- 
tention is devoted to the classical Heisenberg models, where the state space is the 
d-dimensional unit sphere and the interactions are proportional to the cosines of the 
angles between neighboring spins. The phenomenon of interest here is the classifica- 
tion of phase transition (non-uniqueness of the Gibbs state) according to whether it is 
robust. In many cases, including all of the Heisenberg and Potts models, occurrence of 
robust phase transition is determined by the geometry (branching number) of the tree 
in a way that parallels the situation with independent percolation and usual phase 
transition for the Ising model. The critical values for robust phase transition for the 
Heisenberg and Potts models are also calculated exactly. In some cases, such as the 
g > 3 Potts model, robust phase transition and usual phase transition do not coin- 
cide, while in other cases, such as the Heisenberg models, we conjecture that robust 
phase transition and usual phase transition are equivalent. In addition, we show that 
symmetry breaking is equivalent to the existence of a phase transition, a fact believed 
but not known for the rotor model on Z^. 
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1 Definition of the model and main results 



Particle systems on trees have produced the first and most tractable examples of certain 
qualitative phenomena. For example, the contact process on a tree has multiple phase 
transitions, ([19, 12, 22]) and the critical temperature for the Ising model on a tree is 
determined by its branching number or Hausdorff dimension ([13, 8, 20]), which makes 
the Ising model intimately related to independent percolation whose critical value is also 
determined by the branching number (see [14]). In this paper we study several models 
on general infinite trees, including the classical Heisenberg and Potts models. Our aim is 
to exhibit a distinction between two kinds of phase transitions, robust and non-robust, as 
well as to investigate conditions under which robust phase transitions occur. 

In many cases, including the Heisenberg and Potts models, the existence of a robust 
phase transition is determined by the branching number. However, in some cases (in- 
cluding the q > 2 Potts model), the critical temperature for the existence of usual phase 
transition is not determined by the branching number. Thus robust phase transition be- 
haves in a more universal manner than non-robust phase transition, being a function of 
the branching number alone, as it is for usual phase transition for independent percola- 
tion and the Ising model. Although particle systems on trees do not always predict the 
qualitative behavior of the same particle system on high-dimensional lattices, it seems 
likely that there is a lattice analogue of non-robust phase transition, which would make 
an interesting topic for further research. Another unresolved question is whether there is 
ever a non-robust phase transition for the Heisenberg models (see Conjecture 1.9). 

We proceed to define the general statistical ensemble on a tree and to state the main 
results of the paper. Let G be a compact metrizable group acting transitively by isometries 
on a compact metric space (S, d). It is well known that there exists a unique G— invariant 
probability measure on S, which we denote by c/x. An energy function is any noncon- 
stant function : S x S — > IR that is symmetric, continuous, and d-invariant in that 
H{x,y) depends only on d{x,y). This implies that 

H{x, y) = H{gx, gy) Vx, y € S, 5 G G. 

S together with its G-action and the function H will be called a statistical ensemble. 
Several examples with which we will be concerned are as follows. 
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Example 1 The Ising model. Here S = {1,-1} acted on by itself (multiplicatively), d is 
the usual discrete metric, dx is uniform on S, and H[x, y) = —xy. 

Example 2 The Potts model. Here S = {0, 1, . . . , g — 1} for some integer q > I, G is the 
symmetric group Sq with its natural action, d is the usual discrete metric, dx is uniform 
on S, and H{x,y) = 1 — 25x,y This reduces to the Ising model when q = 2. 

Example 3 The rotor model. Here S is the unit circle, acted on by itself by translations, 
d{9, (p) = 1 — cos{9 — (p), dx is normalized Lebesgue measure, and H{6, (p) = — cos{9 — (p). 

Example 4 The Heisenberg models for d>l. In the d-dimensional Heisenberg model, S 
is the unit sphere S'^, G is the special orthogonal group with its natural action; d{x, y) is 
1 — X ■ y, dx is normalized surface measure, and H{x, y) is again the negative of the dot 
product of X and y. When d = \, we recover the rotor model. 

Let A be any finite grapli, witli vertex and edge sets denoted by ^(^4) and E{A) 
respectively, and let J : E{A) be a function mapping tire edge set of A to the 

nonnegative reals wliich we call interaction strengths. We now assume that S, G and 
H are given and fixed. 

Definition 1 The Gibbs measure with interaction strengths J is the probability mea- 
sure \i = on S^^"^) whose density with respect to product measure IS given 
by 

exp(-JJ-^(;/)) ^ ^gSy(A) 

z 

where 

H^iv)= E Jie)H{nix),viy)), 

e=xyeE{A) 

and Z = J exTp{—H'^(r]))dx^^'^^ is a normalization. 

In statistical mechanics, one wants to define Gibbs measures on infinite graphs A 
in which case the above definition of course does not make sense. We follow the usual 
approach (see [10]), in which one introduces boundary conditions and takes a weak limit of 
finite subgraphs increasing to A. Since the precise nature of the boundary conditions play 
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a role here (we know this to be true at least for the Potts model with g > 2), we handle 
boundary conditions with extra care and, unfortunately, notation. We give definitions 
in the case of a rooted tree, though the extensions to general locally finite graphs are 
immediate. By a tree, we mean any connected loopless graph T where every vertex has 
finite degree. One fixes a vertex o of T which we call the root, obtaining a rooted tree. 
The vertex set of T is denoted by V^(r). If x is a vertex, we write \x\ for the number of 
edges on the shortest path from o to x and for two vertices x and y, we write — y| for 
the number of edges on the shortest path from x to y. For vertices x and y, we write x <y 
if X is on the shortest path from o to y, x < y ii x < y and x ^ y, and x ^ y ii x < y and 
\y\ = \x\ + 1. For x G V{r), the tree r(x) denotes the subtree of F rooted at x consisting 
of X and all of its descendents. We also define dF, which we refer to as the boundary of 
F, to be the set of infinite self-avoiding paths starting from o. Throughout the paper, the 
following assumption is in force. 

ASSUMPTION: For all trees considered in this paper, the number of children of the 
vertices will be assumed bounded and we will denote this bound by B. 

A cutset C is a finite set of vertices not including o such that every self-avoiding 
infinite path from o intersects C and such that there is no pair x,y G C with x < y. Given 
a cutset C, F\C has one finite component (which contains o) which we denote by C* ( "i" 
for inside) and we let C° ( "o" for outside) denote the union of the infinite components of 
F\C. We say that a sequence {C„} of cutsets approaches oo if for all v e T, v E for 
all sufficiently large n. 

Boundary conditions will take the form of specifications of the value of 77 at some cutset 
C. Let 5 be any element of S'^. The Gibbs measure with boundary condition 5 is the 
probability measure = ^q'^ on S*^" whose density with respect to product measure 
is given by 



r?GS^ (1.1) 



where 



H^'\v)= E J{e)H{rj{x)My))+ E J{e)Hiv{x),Siy)) 



e=xy&E(T) e=xyeE(r) 



and Z = /exp(-ijg'''^(r/))dx^' is a normalization. When we don't include the second 
summand above, we call this the free Gibbs measure on C*, denoted by fi^^^, where J 
is suppressed in the notation. As we will see in Lemma 1.1, the free measure does not 
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depend on C except for its domain of definition, so we can later also suppress C in the 
notation. 

Definition 2 A probability measure fi on S^^^^ is called a Gibbs state for the interac- 
tions J if for each cutset C , the conditional distribution on given the configuration 5' 
on C \J C° is given by fi^' where S is the restriction of 6' to C. (A similar definition 
is used for general graphs.) Both in the case of lattices and trees (or for any graph), we 
say that a statistical ensemble exhibits a phase transition (PT) for the interaction 
strengths J' if there is more than one Gibbs state for the interaction strengths J . 

In the next section we will prove 

Lemma 1.1 Fix interaction strengths J and let C and D be any two cutsets ofT. Then 
the projections of i/q^"^ and /Jf^^ to S'^''^^* are equal. Hence the measures ^Jq^ have a 
weak limit as C ^ oo, denoted /J^^^. 

For general graphs, the measures /x^^^ are not compatible in this way. Also, one has 
the following fact, which follows from Theorems 4.17 and 7.12 in [10]. 

Lemma 1.2 If {Cn} is a sequence of cutsets approaching oo and if for each n, 5n £ S*^", 
then any weak subsequential limit of the sequence {A*(7^"}n>i is a Gibbs state for the 
interactions J . In addition, if all such possible limits are the same, then there is no phase 
transition. (A similar statement holds for graphs other than trees.) 

We pause for a few remarks about more general graphs, before restricting our discussion 
to trees for the rest of the paper. Lemma 1.1 does not apply to graphs with cycles, so the 
existence of a unique weak limit fi^^^^ is not guaranteed there, but Lemma 1.2 together with 
compactness tells us that there always is at least one Gibbs state. The state of knowledge 
about the rotor model (Example 3) on more general graphs is somewhat interesting. It 
is known (see [10], p. 178 and p.434) that for Z"^, d <2, all Gibbs states are rotationally 
invariant when J7 = J for any J (and it is believed but not known that there is a unique 
Gibbs state for the rotor model in this case) while for d > 3, there are values of J for 
which the rotor model with J = J has a Gibbs state whose distribution at the origin is 
not rotationally invariant (and hence there is more than one Gibbs state). In statistical 
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mechanics, this latter plicnomenon is referred to as a continuous symmetry breaking since 
we have a continuous state space (the circle) where the interactions are invariant under a 
certain continuous symmetry (rotations) but there are Gibbs states which are not invariant 
under this symmetry. We also mention that it is proved in [6] that for the rotor model 
with J = J for any J on any graph of bounded degree for which simple random walk is 
recurrent, all the Gibbs states are rotationally invariant. (This was then extended in [15] 
where the condition of boundedness of the degree is dropped and the group involved is 
allowed to be more general than the circle.) This however is not a sharp criterion: in [7], 
a graph (in fact a tree) is constructed for which simple random walk is transient but such 
that there is no phase transition in the rotor model when J = J ^ov any J. (This will 
also follow from Theorem 1.10 below together with the easy fact that there are trees with 
branching number 1 for which simple random walk is transient.) However, Y. Peres has 
conjectured a sharp criterion. Conjecture 1.12 below, for which our Corollary 1.11 together 
with the discussion following it provides some corroboration. 

For the rest of this paper, we will restrict to trees. It is usually in this context that 
the most explicit results can be obtained and our basic goal is to determine whether there 
is a phase transition by comparing the interaction strengths with the "size" (branching 
number) of our tree. It turns out that we can only partially answer this question but 
the question which we can answer more completely is whether there is a robust phase 
transition, a concept which we will introduce shortly. 

Definition 3 Given J7, C and 5 defined on C , let f^'^ (or f^g ij J is understood) denote 
the marginal density of at the root a. 

For any tree, recall that T{v) denotes the subtree rooted at v, so that the tree r(u) has 
vertex set {w £ T : v < w}. If € and we intersect C with T(v), we obtain a cutset 
C{v) for r{v). We now extend Definition 3 to other marginals as follows. 

Definition 4 With J', C and 8 as in Definition 3 and v € C*, define f^'^ by replacing T 
by T(v), C with C{v), J with J restricted to E{r{v)), 6 with 6 restricted to C(v) and o 
with V in Definition 3. 

It is important to note that f^^ is not the density of the projection of /xgr' onto 
vertex v, but rather the density of a Gibbs measure with similar boundary conditions on 
the smaller graph T{v). 
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Definition 5 A statistical ensemMe on a tree T exhibits a symmetry breaking (SB) 
for the interactions J' if there exists a Gibbs state such that the marginal distribution 
at some vertex v is not G-invariant (or equivalently is not dx.). 

The following proposition which will be proved in Section 2 is interesting since it 
establishes the equivalence of PT and SB for general trees and general statistical ensembles, 
something not known for general graphs, see the remark below. 

Proposition 1.3 Consider a statistical ensemble on a tree V with interactions J . The 
following four conditions are equivalent. 

(i) There exists a vertex v such that for any sequence of cutsets Cn — ^ oo, there exist 
boundary conditions 5n on Cn such that 

inf||/fe,.-l||oo^O. 

(ii) There exists a vertex v, a sequence of cutsets C„ — > oo and boundary conditions Sn on 
Cn such that 

inf||/fe,.-l||oo^O. 

(Hi) The system satisfies SB. 
(iv) The system satisfies PT. 

We now fix a distinguished element in S, hereafter denoted 0. The notation 
denotes /i^^' when 6 is the constant function 0. In the case ^ = J, we denote this simply 
\1q . We will be particularly concerned about whether \1q — > weakly, as C — > oo. 

Definition 6 A statistical ensemble on a tree T exhibits a symmetry breaking with 
plus boundary conditions (SB+) for the interactions J' if there exists a vertex v 

and a sequence of cutsets Cn ^ oo such that 

inf||/S-l|U^O- 

Note that by symmetry, SB+ does not depend on which point of S is chosen to be 0. 
In Section 4.1 we will prove: 

Proposition 1.4 For the rotor model on a tree, SB is equivalent to SB+. 



6 



We conjecture but cannot prove the stronger statement: 

Conjecture 1.5 For any Heisenberg model on any graph, SB is equivalent to SB+. 

Remarks: (i) By Proposition 1.3, we have that SB+ impHes SB for any statistical ensemble 
on a tree. While Proposition 1.3 tells us that PT and SB are equivalent for any statistical 
ensemble on a tree, we note that such a result is not even known for the rotor model on 
where it has been established that for all J, all Gibbs states are rotationally invariant for 
J = J but where it has not been established that there is no phase transition. A weaker 
form of the above conjecture would be that SB+ and SB are equivalent for all Heisenberg 
models on trees. This is Problem 4.1 in Section 4. An extension to graphs with cycles 
would seem to entail a different kind of reasoning, perhaps similar to the inequalities of 
Monroe and Pearce [16] which fall just short of proving Conjecture 1.5 for the rotor model. 
(m) The fact that PT and SB+ are equivalent when the rotor model is replaced by the Ising 
model is an immediate consequence of the fact that the probability measure is stochasti- 
cally increasing in the boundary conditions. More generally, it is also the case that PT 
and SB-I- are equivalent for the Potts models (see [2]). 

We now consider the idea of a robust phase transition where we investigate if the bound- 
ary conditions on a cutset have a nontrivial effect on the root even when the interactions 
along the cutset are made arbitrarily small but fixed. 

Given parameters J > and J' € (0, J] and a cutset C of F, let J^{ J' , J,C) be the 
function on E{T) which is J on edges in C* and J' on edges connecting C* to C (the values 
elsewhere being irrelevant). Let ''^^'^ denote the marginal at the root o of the measure 

l^c ■— l^c 

Definition 7 The statistical ensemble on the tree F has a robust phase transition 
(RPT) for the parameter J > if for every J' G (0, J] 

inf||/i^'^-l|U7^0 

where the inf is taken over all cutsets C. 

Remarks: In the case JT" = J, by taking J' = J, it is clear that a RPT implies SB-|- (which 
in turn implies SB and PT). Note that in this case, RPT is stronger than SB-|- not only 
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because J' can be any number in (0, J] and the root o must play the role of v but also 
because in SB+, we only require that for some sequence of cutsets going to infinity, the 
marginal at the vertex v stays away from uniform while in RPT, we require this for all 
cutsets going to infinity. We note also that with some care, this definition makes sense for 
general graphs, and that the issue of robustness of phase transition on general graphs is 
worth investigating, although we do not do so here. 

Our first theorem gives criteria based on J and the branching number of T (which 
will now be defined) for robust phase transition to occur for the Heisenberg models. A 
little later on, we will have an analogous result for the Potts models. In [9], Furstenberg 
introduced the notion of the Hausdorff dimension of a tree (or more accurately of the 
boundary of the tree). This was further investigated by Lyons ([14]) using the term 
branching number instead. The branching number of a tree T, denoted 6r(r), is a real 
number greater than or equal to one that measures the average number of branches per 
vertex of the tree. More precisely, the branching number of T is defined by 

br r := inf I A > 0; inf E.ec A"!"! = o| 

where the second infimum is over all cutsets C. The branching number is a measure of the 
average number of branches per vertex of F. It is less than or equal to liminf^^go 
where M„ :=|{a;€r;|x|=n}|, and takes more of the structure of V into account than 
does this latter growth rate. For sufficiently regular trees, such as homogeneous trees 
or, more generally, Galton- Watson trees, hrV = YaHn^tx, Mn^"" ([14]). We also mention 
that the branching number is the exponential of the Hausdorff dimension of dV where 
the latter is endowed with the metric which gives distance e~'' to two paths which split 
off after k steps. As indicated earlier, the branching number has been an important 
quantity in previous investigations. More specifically, in [13] and [14], the critical values 
for independent percolation and for phase transition in the Ising model on general trees 
are explicitly computed in terms of the branching number. 

For each J > 0, define a continuous strictly positive probability density function Kj : 
S ^ ]R+ by 

Kj{u) := C{jy^ exp(- Ji?(u, 6)) (1.2) 

where C(J) = / eyip{—JH{w,0))dx{w) is a normalizing constant, and more generally let 
Kj^y : S — s- R"'" be given by 

Kj,y{u) := C(J)-i eM-JH{u, y)) (1.3) 
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(noting that Kjq = Kj). Let /Cj denote the convolution operator on the space L^(S,dx) 
given by the formula 

/Cj/(n) := / f{x)Kj,,{u)d^{x) . (1.4) 

JS 

Note that by the assumed invariancc Jgexp(—JH{w,y))dx{w) is independent of y and 
that / > and Jg /(.'r)(ix(x) = 1 imply that /Cj/ > and /g /Cj/(x)(ix(x) = 1. We 
extend the above notation to cover the case where / is a pointmass Sy at y by defining in 
that case 

JCj5y{u) := Kj^y{u). (1.5) 

We will now give the exact critical parameter J for RPT for the Heisenberg models. 
For any d > 1, let 

/(J) := 



!\eJ^{l-r'^)i-^dr 

When d = 1 (rotor model), this is (by a change of variables) the first Fourier coefficient 
of Kj (Jg Kj{6) cos{6)d9) which is perhaps more illustrative. When d = 2, this is the first 
Legendre coefficient of eJ^' (properly normalized) and for d > 3, this is the first so-called 
ultraspherical coefficient of e"^^ (properly normalized). 

Theorem 1.6 Letd>l. 

(i) If br{T)p'^{J) < 1, then the d-dimensional Heisenberg model on V with parameter J 
does not exhibit a robust phase transition. 

(ii) If br{r)p'^{J) > 1, then the d-dimensional Heisenberg model on T with parameter J 
exhibits a robust phase transition. 

Remark: It is easy to see that \im.d^oo P^{J) = which says that it is harder to obtain 
a robust phase transition on higher dimensional spheres. This is consistent with the fact 
that it is in some sense harder to have a phase transition for the rotor model than in the 
Ising model (0-dimensional sphere); this latter fact can be established using the ideas in 
[18]. 

A simple computation shows that the derivative of p'^{J) with respect to J is the 
variance of a random variable whose density function is proportional to e'^^{l — r^)*^/^"-*^ 
on [—1, 1], thereby obtaining the following lemma. 

Lemma 1.7 For any d>l, we have that p'^{J) is a strictly increasing function of J. 
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Theorem 1.6 and Lemma 1.7 together with the fact that for any d > 1, p'^{J) is a 
continuous function of J which approaches as J ^ and approaches 1 as J ^ oo give 
us the following corollary. 

Corollary 1.8 For any Heisenberg model with d > 1 and any tree F with branching 
number larger than 1, let Jc = Jc{F,d) be such that br{r)p'^{Jc) = 1. Then there is a 
robust phase transition for the d-dimensional Heisenberg model onT if J > Jc and there 
is no such robust phase transition for J < Jc- 

For the Heisenberg models, we believe that phase transition and robust phase transition 
coincide and therefore we have the following conjecture. 

Conjecture 1.9 For any d > 1, if hr{T)p'^{J) < 1, then the d-dimensional Heisenberg 
model on T with parameter J does not exhibit a phase transition. 

We can however obtain the following weaker form of this conjecture which is valid for 
all statistical ensembles. 

Theorem 1.10 Ifbr{T) = 1, then there is no phase transition for any statistical ensemble 
on r with bounded J . 

Theorems 1.6(ii) and 1.10 together with the facts that RPT implies PT and that for 
any d > 1, \m\.j^^ p^(^J) = 1 immediately yield the following corollary. 

Corollary 1.11 For any Heisenberg model with d > 1 and for any tree T, there is a phase 
transition for the tree T for some value of the parameter J if and only ifbriV) > 1. 

Since it is known (see [14]) that br{T) > 1 if and only if there is some p < 1 with 
the property that when performing independent percolation on F with parameter p, there 
exists a.s. an infinite cluster on which simple random walk is transient, the above corollary 
yields the following conjecture of Y. Peres for the special case of trees of bounded degree. 

Conjecture 1.12 For any graph A, the rotor model exhibits a phase transition for some 
J if and only if there is some p < 1 with the property that performing independent bond 
percolation on A with parameter p, there exists a.s. an infinite cluster on which simple 
random walk is transient. 



10 



Recall that the rotor model on the graph A exhibits no SB for any parameter J if ^ is 
recurrent for simple random walk, which is of course consistent with the above conjecture. 
Note that, on the other hand, the standard Ising model does exhibit a phase transition on 
Z^, a graph which is recurrent (as are its subgraphs) for simple random walk. 

The next result states the critical value for RPT for the Potts models. 
Theorem 1.13 Consider the Potts model with q>2 and let 



(i) If br(T)aj < 1, then the Potts model on T with parameter J does not exhibit a robust 
phase transition. 

(ii) If br{T)aj > 1, then the Potts model on T with parameter J exhibits a robust phase 
transition. 

Remarks: 

(i) daj/dJ > and so there is a critical value of J depending on br{r) analogous to in 
Corollary 1.8 for the Heisenberg models. 

(ii) Note that when q = 2 (the Ising model), this formula agrees with the formula for the 
Heisenberg models when one formally sets d = in the formula 



the latter being obtained by a change of variables. 

To point out the subtlety involved in Conjecture 1.9, we continue to discuss the Potts 
model, a case in which the analogue of Conjecture 1.9 fails. Our final result tells us that 
phase transitions (unlike robust phase transitions) in the Potts model with q > 2 cannot 
be determined by the branching number. 

Theorem 1.14 Given any integer q > 2, there exist trees Ti and T2 and a nontrivial 
interval I such that br{Ti) < br{T2) and for any J G I, there is a phase transition for the 
q-state Potts model with parameter J on Ti but no such phase transition on 



aj = 



e-' + {q- l)e--^ ' 
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Remarks: 

(z) Ti and r2 can each be taken to be spherically symmetric which means that for all k, 
all vertices at the kth generation have the same number of children. 

(a) In the case q = 2, more is known. In [13], the critical value for phase transition in 
the Ising model is found and corresponds to what is obtained in Theorem 1.13 above. It 
follows that there is never a non-robust phase transition except possibly at the critical 
value. However, a sharp capacity criterion exists [20] for phase transition for the Ising 
model (settling the issue of phase transition at the critical parameter) and using this 
criterion, one can show that phase transition and robust phase transition correspond even 
at criticality. The arguments of [20] cannot be extended to the Potts model for g > 2 
because the operator JCj, acting on a certain likelihood function, when conjugated by the 
logarithm is not concave in this case. Theorems 1.13 and 1.14 together tell us that there 
is indeed a non-robust phase transition when g > 2 for a nontrivial interval of J. 

The rest of the paper is devoted to the proofs of the above results. In Section 2, 
we collect several lemmas that apply to general statistical ensembles, including the basic 
recursion formula (Lemma 2.2) that allows us to analyze general statistical ensembles on 
trees, prove Lemma 1.1 and Proposition 1.3 as well as provide some background concerning 
Heiscnbcrg models (showing that they satisfy the more general hypotheses of Theorems 3.1 
and 3.2 given later on) and the more general notion of distance regular spaces. Section 3 
is devoted to the proofs of Theorems 3.1 and 3.2. In Section 4, we use these theorems 
to find the critical parameters for robust phase transition in the Heisenberg and Potts 
models. Theorems 1.6 and 1.13, as well as prove Proposition 1.4. Section 5 discusses the 
special case of trees of branching number 1, proving Theorem 1.10. Finally, in Section 6, 
Theorem 1.14 is proved. 

2 Basic background results 

In this section, we collect various background results which will be needed to prove the 
results described in the introduction. We begin with a subsection describing results per- 
taining to trees that hold for general statistical ensembles. After discussing the concept 
of a distance regular space in Section 2.2, we specialize to Heisenberg models (the most 
relevant family of continuous distance regular models) in Section 2.3 and then to distance 
regular graphs in Section 2.4. 
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2.1 The fundamental recursion and other lemmas 



We start off with two lemmas exploiting the recursive structure of trees. 

Let S, G and if be a statistical ensemble. Let Ai and A2 be two disjoint finite graphs, 
with distinguished vertices vi € ^(^1) and V2 G V{A2). Let Ji and J2 be interaction 
functions for Ai and i-C-, positive functions on E{Ai) and E{A2) respectively. For 
any Ci C \ {vi\ (possibly empty) and any C2 Q V{A2), and for any 5i € S'^i 

and 82 € 8*^^, we have measures /Xj := /i^""^', i = 1,2 on S^^'^'^^'-'* defined (essentially) 
by (1.1). Abbreviate Hfj,' ' (which has the obvious meaning) by Hi. Let A be the union 
of Ai and A2 together with an edge connecting vi and V2- Let C = Ci U C2, J extend 
each Ji and the value of the new edge be given the value J, 5 extend each 5i and denote 
/i^'"^ (a probabihty measure on S^y<yAi)\Ci)^{V{A2)\C2)^ by ^ a,nd H^'^ (again having the 
obvious meaning) by H. The identity 

H = Hi + H2 + JH{ri{vi),ri{v2)) (2.1) 
leads to the following lemma. 



Lemma 2.1 The measure ji satisfies 



d{m X /X2) 



c exp [- Jii (r/i (-ui ) , ry2 (w2 ) )] , 



(2.2) 



where 



c = 



exp{-JH{r)i{vi),ri2iv2)))dfXi{rii)diJ,2{rj2) 



is a normalizing constant. Let fi denote the marginal density of fXi at vi, i = 1,2, and 
f denotes the marginal density of fi at vi. Then the projection fi^^^ of fi onto S^^^'^^^^'^ 
satisfies 

= cj J fii,yfi{y)f2{z) eM-JH{y, z)) d-^{z) d^{y) (2.3) 

for some normalizing constant c, where jii^y denotes the conditional distribution of /xi 
given rj{vi) = y. Consequently, 



f{y) = cfi{y) I f2{z)eM-JH{y,z))d:x{z), 
where c normalizes f to be a probability density. 



(2.4) 
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Proof. The relation (2.2) follows from (2.1) and the defining equation (1.1). From this 
it follows that the measure fi on pairs (??i,r?2) makes rji and r/2 conditionally independent 
given rii{vi) and ^72(^^2)- Hence the conditional distribution of /i^^) given r]i{vi) = y and 
i]2iv2) = z is just fii^y. Next, (2.2) and the last fact yield (2.3). The marginal of //i.-y at 
vi is just 6y, and so (2.3) yields (2.4). □ 

A tree T may be built up from isolated vertices by the joining operation described in 
the previous lemma. The decompositions in Lemma 2.1 may be applied inductively to 
derive a fundamental recursion for marginals. This recursion, Lemma 2.2 below, expresses 
the marginal distribution at the root of F as a pointwise product of marginals at the roots 
of each of the generation 1 subtrees, each convolved with a kernel Kj. The normalized 
pointwise product will be ubiquitous throughout what follows, so we introduce notation 
for it. 



Definition 8 If fi, ■ ■ ■ , fk cifP- nonnegative functions on S with J /j dx = 1 for each i, let 
Q]^{fi, . . . , fk) denote the normalized pointwise product, 

whenever this makes sense, e.g., when each fi is in L^{dyi) and the product is not almost 
everywhere zero. Let Q denote the operator which for each k is on each k-tuple of 
functions. There is an obvious associativity property, namely Q{Q{f,g),h) = Q{f,g,h), 
which may be extended to arbitrarily many arguments. 

Lemma 2.2 (Fundamental recursion) Given a tree T, a cutset C, interactions J, 
boundary condition 5 and v G C, let {wi, . . . ,Wk} be the children of v. Let Ji, . . . ,Jk 
denote the values of J^{v, wi), . . . , J^{v, Wk). Then 

fS = Oi^jJciv ■ ■ -^^jJcL) > (2-5) 

T S 

where when Wi G C, f^'y,. is taken to be the point mass at S{wi) and convention (1-5) is 
in effect. 

Proof. Passing to the subtree T{y), we may assume without loss of generality that v = 0. 
Also assume without loss of generality that wi,...,Wk are numbered so that for some s, 
Wi G C* for i < s and Wi E C fov i > s. For i < s, let C{wi) = C H r{wi). For such i, by 
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definition, /j := /^^, is tlie marginal at Wi of tlic measure fn := ^^(^.-j ^. on configurations 
on T{wi) n C*, where J' and 6 are restricted to £'(r(i(;j)) and C{wi) respectively. Let 
denote the induced subgraph of T whose vertices are the union of {o}, T(wi), . . . ,T{wr). 
We prove by induction on r that the density gr at the root of of the analogue of i^q' 
for is equal to 

The case r = k is the desired conclusion. 

To prove the r = 1 step, use (2.4) with vi = o, Ai = {o}, C\ = $,V2 = wi, A2 = T{wi) 
and C2 = C{wi). If wi G C, the r = 1 case is trivially true, so assume s > 1. The measure 
Hi is uniform on S since C{v) = 0. Thus from (2.4) we find that 

51 (y) = c j e-'^"^y^^^h{z)dz = {lCj,h){y) 

which proves the r = 1 case. 

For 1 < r < s, use (2.4) with Ai = F^-i, vi = o, Ci = F^_i nC, A2 = T(wr), V2 = Wr 
and C2 = T{wr) n C. Using (2.4) we find that 

= cgr-i{y){JCjJr){y) 

= {(i){9r-l,K,jJr)){y) . 

By associativity of O the induction step is completed for r < s. 

Finally, if r > s, then the difference between H{ri) on F^-i and H{r}) on F^ is just 
—JrH{r]{o),6{wr)), so 

9r{y) = cgr-l{y) ew{-JrH{y, 6{Wr))) = {Q{gr-l,K^Jrfr)) {v) 

by the convention (1.5), and associativity of Q completes the induction as before. □ 

Another consequence of Lemma 2.1 is Lemma 1.1, giving the existence of a natural 
and well defined free boundary measure. 

Proof of Lemma 1.1. Observe that in (2.3), if /2 = 1 then the integral against z is 
independent of y, so one has ijS^^ = fii. Let F be any cutset and w € be chosen so 
each of its children vi, . . . ,Vk is 'm F. Applying our observation inductively to eliminate 
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each child of w in turn, we see that the projection of jJp'^ onto S^'^'^"'-'^ is just /JpT where 
F = Fu{w}\{vi,...,Vk}. 

Given cutsets C and D with DOC^ ^ ^, choose v e DDC^ and w >v maximal in C*. 
Then all children of w arc in C. Applying the previous paragraph with F = C, we see 
that fifif^ agrees with /i^?'^. Continually reducing in this way, we conclude that on fl-D' 
^(^^ agrees with fiq^'^ where Q is the exterior boundary of C* H D*. The same argument 
shows that fi^^^ agrees with /Xq®^, which finishes the proof of the lemma. □ 

According to Lemma 2.2, if, for J > 0, we define V{J) to be the smallest class of 
densities containing each Kj' y for J' G (0, J] and y G S and closed under /Cj/ for J' G (0, J] 

T S 

and Q, then, when J' is strictly positive and bounded by J, each density f^^ is an element 
of V{J)- Similarly, if V+{J) is taken to be the smallest class of densities containing 
each Kji for J' G (0, J] and closed under Kji for J' G (0, J] and Q) then, when J is 

T -U 

strictly positive and bounded by J, each density f^'^ is an element of V+{J). We also let 
V ■= {Jj>onJ) and r+ := Uj>o^+(^)- 

This leads to the following lemma whose proof is left to the reader. 

Lemma 2.3 Suppose the interaction strengths {i7(e)} are bounded above by some con- 
stant. Then there exist constants < -Bmin < -Bmax such that for every C, S and v G C*, 
the one- dimensional marginal of /Iq at v is absolutely continuous with respect to dx with 
a density function in [-Bmin, Smax]- It follows, since the above properties are closed under 
convex combinations, that all one- dimensional marginals of any Gibbs state have densi- 
ties in [Smiii; -Smax]- Similarly, the k-dimensional marginals have densities in the interval 
[-^min' -^max] for some Constants < -B^j^ < B^lx- In addition, the family of all one- 
dimensional densities which arise as above is an equicontinuous family. 

The usefulness of the cquicontinuity property is that the following easily proved lemma 
(whose proof is also left to the reader) tells us that in determining weak convergence to 
dx, it is equivalent to look to see if there is convergence in L°° of the associated densities 
to 1. 

Lemma 2.4 Let {X, d) be a compact metric space and fi a probability measure on X with 
full support. If {fn} is an equicontinuous family of probability densities (with respect to 
ji), then 

lim ll/n — l||oo = ^/ O'f^d only if lim fndfJL = ji weakly . 
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Using this, we can prove the equivalence of phase transition and symmetry breaking 
on trees (Proposition 1.3). 

Proof of Proposition 1.3. (i) implies (ii) is trivial. For (ii) implying (iii), assume we 
have a vertex v, a sequence of cutsets C„ — > oo and boundary conditions Sn on C„ such 
that 

inf||/^:,.-l|Uf^O. 

Clearly we obtain the same result if we change Sn on C„ \ T{v) to anything, in particular, 
if we take no (i.e., free) boundary condition there. We then take any weak limit of these 
measures as n — > oo. This will yield a Gibbs state and by the first line of the proof of 
Lemma 1.1, together with Lemma 2.4, the marginal density at v of this Gibbs state is not 
1, which proves (iii). (iii) implies (iv) is also trivial of course. To see that (iv) implies (i), 
note that if there is PT, then there exists an extremal Gibbs state fi ^ /J^^^. Choose a 
cutset C such that fi ^ /J^^^ when restricted to C*. If (i) fails, then for all -y G C, there 
exists a sequence of cutsets C„ ^ oo such that for all boundary conditions Sn on Cji WG 
have that 

inf||/fe,.-l||oo = 0. (2.6) 

Clearly, because of the geometry, {Cn} can be chosen independent of v. Since fi is extremal, 
it is known (see Theorem 7.12(b) in [10], p. 122) that there exist boundary conditions S'n 
on Cn so that /j.^^ /j, weakly. However, by (2.6) and Lemma 2.2, /x must equal fi ^^^ on 
C, a contradiction. □ 

2.2 Distance regular spaces 

Our primary interest in this paper is in the Heisenberg models. Nevertheless, it turns out 
that many of the properties of the Heisenberg model hold in the more general context of 
distance regular spaces. A distance regular graph is a finite graph for which the size of 
the set {z : d{x, z) = a, d{y, z) = b} depends on x and y only through the value of d{x, y) 
where d{x, y) is the usual graph distance between x and y. We generalize this by saying 
that the metric space (S, d) with probability measure dx is distance regular if the law 
of the pair {d{x, Z),d{y, Z)) when Z has law dx depends only on d{x,y). In particular, 
when the action of G on S is distance transitive (in addition to preserving d and dx), 
meaning that (x, y) can be mapped to any (x', y') with d{x, y) = d{x', y'), it follows easily 
that (S, d, dx) is distance regular. All the examples we have mentioned so far are distance 
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transitive (and hence distance regular) except for the rotor model which is still distance 
regular. (For an example of a graph showing that the full automorphism group acting 
distance transitively is strictly stronger than the assumption of distance regularity, see [1] 
or Additional Result 23b of [5].) 

We present some of the background in this generality not because we are fond of 
gratuitous generalization but because we find the reasoning clearer, and because it seems 
reasonable that someone in the future might study a particle system whose spin states 
are elements of some distance regular space, such as real projective space or the discrete 
n-cube. The primary consequence of distance regularity is that it allows one to define a 
commutative convolution on a certain subspace of L^. 

Definition 9 Let -L^(S) denote the space L^((ix), and let L^(S/0) denote the space of 
functions f G -^^(S) for which f{x) depends only on d{x,b). For f G L^(S/0), define a 
function f on {d{b,y)}y^s by fir) '■= f{x) where x is such that d{0,x) = r. 

Definition 10 //(S,(ix) is distance regular, define a commutative convolution operation 
on L2(S/6) X L2(S/6) by 

f*h{x):= / h{y)f{d{x,y))dx{y)= / f{u)h{v)d-Kx{u,v) 

JS ■/[0,oo)2 

where tt^ is the law of {d{x, Z),d{Q, Z)) for a variable Z with law dx. It is clear from 
the definition of a distance regular space that {d{x, Z),d{0, Z)) and {d(p, Z),d{x, Z)) are 
equal in distribution implying that f * h = h* f and that, since tTj, only depends on d{x,0), 
f,he L2(S/6) implies that f*heL^{S/6). 

The following lemma is straightforward and left to the reader. 

Lemma 2.5 For all J > 0, Kj e L^{S/6) and for all h G L^{S), lCj{h){x) (defined 
in (1-4)) is equal to J^h{y)Kj{d{x,y))dx.{y). In particular, if {S^dx) is distance regular, 
then the operators Kj map L^(S/0) into itself and ]Cj{h) = Kj * h for all h G L^(S/0). 

We believe that for most distance regular spaces, one can verify the necessary hypothe- 
ses of Theorems 3.1 and 3.2 below in the same way as we will do for the Heisenberg models 
in detail in the next section. Doing this however would take us too far afield and so we 
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content ourselves with pointing out to the reader that much of this probably can be done, 
and after analyzing the Heisenberg models in Section 2.3, explain how to carry much of 
this out in the context of distance regular graphs in Section 2.4. 



2.3 Heisenberg models 

In this subsection, we consider Example 4 in Section 1 and so we have S = 5*^, d > 1, 
the unit sphere in (d + 1) -dimensional Euclidean space with the corresponding G, d, dx 
and H. Recall that this is distance transitive for d > 2 (and hence distance regular) and 
distance regular for d = 1. The following lemma allows us to set up coordinates in which 
our bookkeeping will be manageable. It is certainly well known. 

Lemma 2.6 For any d > 1, there exist real-valued functions 'tpo,ipi,'tp2, ■ ■ ■ G -L^(S/0) 
(S = S"^), orthogonal under the inner product {f,g) = Jg fgdvi, such that tp^ is a polyno- 
mial of degree exactly n in x ■ 0, and such that the following properties hold. 

(1) ipo{x) = 1 and ipi{x) = x ■ 0. 

(2) 1 = tpjiO) = sup^gs li^jixM, for all j. 

(3) ipiipj = J2r>o Qij'^Pr , where the coefficients qlj are nonnegative and J^rQij — ^■ 

(4) tl^i * = Ij^iji^j, where := V'j * -0^(6) = li'jix) dx{x). 

( 5) The functions ipj are eigenf unctions of any convolution operator, that is, f = cipj 
for anyfeL\S/6). 

(6) Any f G L^(S/0) can he written as a convergent series f{x) = Y^j>Q o,j{f)ipjix) (in the 
sense), where the complex numbers aj{f) are given by aj{f) '■= f f{x)'^j{x)dx.{x). 

(7) For f,ge L'^{S/6), we have aj{f * g) = ljaj{f)aj(g). 

Proof. For each a,j3 > —1, define the Jacobi polynomials {^^n'^\r)}n>o by 

(_-[\n fjn 

(1 _ + r)^Pt^\r) = [(1 - + r)"+^] . (2.7) 

(The Jacobi polynomials are usually defined differently in which case (2.7) becomes what 
is known as Rodrigues' formula but we shall use (2.7) as our definition; when a = (3, which 
is the case relevant to us, these are the ultraspherical polynomials.) 



For any given d > 1, we let, for n > 0, 

p;.^ ^'^ "(x-o) 



Pi^ " \l) 
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By p. 254 in [21], P„ ' is a polynomial of degree exactly n. By p. 259 in [21], the collection 
{Pn''^^}n>o Si.Te ortliogonal on [—1, 1] with respect to the weight function (1 — r)°(l +r)^. 
A change of variables then shows that the tp^s are orthogonal in L'^{S). 

(1) is then an easy calculation, the first equality in (2) is trivial while the second 
equality is in [21], p. 278 and 281. (3) is in [3], p.41. (4) and (5) follow from the Funk-Hecke 
Theorem ([17], p. 195) (the calculation of 7j being trivial). Since the subspace generated by 
the {Pn^ ('")}'s are uniformly dense in C([— 1, 1]) by the Stone- Weierstrass Theorem, 

it easily follows that the subspace generated by the V'n's are uniformly dense in L^(S/0) fl 
(7(S). Hence the tp^s are a basis for L^(S/0) and (6) follows. Finally, (4) and (6) together 
yield (7). □ 

Note that for all f,g e L^{S/6), we have that fg G L'^{S/6) provided fg G L^{S). 
Since ipn is a polynomial of degree exactly n in x • 0, the greatest r for which qlj ^ must 
hei+j. Prom this and the nonnegativity of the g[j's, it follows that for A > the function 
gAVi(x) ^ ^„>oA"Vi(a;)"/n! has 

aj{e^^^) > 0, for aU j > 0. (2.8) 

It follows from Lemmas 2.2, 2.5 and 2.6(3,4) that V+ C L'^{S/6) and that for all g G V+, 

aj{g) > 0, for ah j > 0. (2.9) 

Definition 11 Define the A norm on L^(S/0) by 

ll/IU = EK(/)l' 

provided it is finite. 

Prom the fact that J2r>o'lij = 1> '^^^ can easily show that for all /, 5 G L^(S/0) with 
fg G L2(S/6), 

ll/5lU<||/IUIblU, (2.10) 

and that equality holds if /, 5 G V+. An easy computation also shows that | je'**'^!''^-' ||a = 
C** < 00 for all A > and hence by Lemmas 2.2 and 2.6(4) and (2.10), < 00 for 

all / G V^. Also, it follows from (2.9), Lemma 2.6(2,6), the fact that //dx = 1 for all 
f €V+ and the fact that V+ C L'^{S/6) that for f €V+, 

1 + 11/ - 1|U = ll/IU = /(6) = ll/lloo = 1 + 11/ - llloo. (2.11) 
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The last equality is obtained by observing that < is clear while H^Hoo ^ llslU 
g G L^(S/0) is also clear. 



oih) 



Lemma 2.7 There exists a function o with lim 

h-*o h 



such that for all hi, . . . ,hk &V-{ 



with k < B, 

k 

WQihi, ...,hk)-l- J2ihi - 1)|U < o{max\\hi - 1|U), 
provided maxj \ \hi — 1\\a < 1- 



1=1 



(2.12) 



Proof. Write 



k k 

i[hi-i-^{h,-i)\\A=\\ E n(^^-i)iu- 

1=1 1=1 Ac{i,...,fc} ieA 

\A\>2 



(2.13) 



Then maxj — 1|| < 1 and submultiplicativity (2.10) of || • ^ implies this is at most 

2*^(max||/ii-l|U)2. 

i 

Next, since /(/ij — 1) dx = for 1 < i < k, we similarly obtain 

k 



< 2''(max||/ii - 1|U) 



We then have 



110(^1,. ..,%)-n^iiu 



1=1 



i=l 



n/i,|U<4*^(max||/ii-l|U)', 



i=l 



since || HiLi ^i|U ^ and / HiLi > 1 by the positivity of the qlj and (2.9). A use of 
the triangle inequality completes the proof. □ 

We note five facts that follow easily from the above, but which will be useful later 
on in generalizing our results. Let be the linear subspace of L^(S/0) spanned by 

7^+, {V+{J)) be the linear subspace of L^(S/0) spanned by V+{J) and ||/Cj/m denote the 
operator norm of Kji on ((P+), || ||a)- 



jim ||i^j,-l|U = 0; 



(2.14) 
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ci := sup ^ — ^jj^ < oo; (2.15) 

mr+),m 11/ - 1|U 



02-.= inf ^'|'r >0; (2.16) 



ForallJ'>0, ||/Cj/|U < 1; (2.17) 
There exist a, 6 € S such that for all / G V+, 

f{a) = sup fix) and /(6) = inf fix). (2.18) 

(2.17), for example, follows immediately from Lemmas 2.5 and 2.6(7) and the fact that 
llnanig)] < 1 for any probability density function g G L^(S/0). 

The results on Heisenberg models presented thus far are parallel to the results obtain- 
able for any finite distance regular graph (see the next subsection) . One useful result that 
is not true for general distance regular models depends on the following obvious geometric 
property of the sphere: 

\{z : dix,z) < a,diy,z) < b}\ 

is a nonincreasing function of ci(x,y) for any fixed a and b where | | denotes surface 
measure. [Proof: For S^, this is obvious. For S"^, d > 2, by symmetry, we can assume 
that X = (0, . . . , 0, 1) and y = (cos 9,0,. .. ,0, sin 9) (both vectors with d+ 1 coordinates). 
Write S"^ as 

where 

Au := 5"'n {(ai,... ,Od+i) : (02,... ,ad) = u}. 

Each Au is a circle (or is empty) and so essentially by the 1-dimensional case, we have the 
desired behaviour on each A^ (using 1-dimensional Lebesgue measure) and by Fubini's 
Theorem, we obtain the desired result on 5"^.] 

Calling a function / G L^(S/0) nonincreasing if the corresponding / is nonincreasing, 
the latter can be seen to be equivalent to the property that 1^^^. q-j<^ * l^^^,^. 5^^^ is nonin- 
creasing, and by taking linear combinations, this is equivalent to f *g being nonincreasing 
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for all nonincrcasing / and g in L^(S/0). Since Kj is nonincreasing for all J, it follows 
from the fundamental recursion that 



/ G Vjf- =^ / is nonincreasing. (2-19) 
Lemma 2.8 For any positive nonincreasing f G L^(S/0), 

Js Js 

for all n > 1 . 

Proof. It suffices to prove this for functions of the form f{x) = i^x-6>t} ^^^^ ^ ^ ["■*-' -*■]• 
We rely on explicit formulae for the functions {tpn}- Letting a = d/2 — 1, a change of 
variables yields 



'(1) 

where 

Sd = j {l-r''Ydr 

and Pn"'"'* is the Jacobi polynomial defined earlier. 

Taking the indefinite integral of each side in (2.7) with (3 = a yields 

{l-rr{l+rrPt''\r)dr = L_L__ [(l _ ^)n+a(i + ^)n+a] 



Evaluating at 1 and t gives 



1 f^Pt'^ HrKl-rY 

{a,a) 



dr 



When n = 1, using (2.7), this is just s^^l - t2)°+i/2(l + a). Dividing, we get 
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The first term in the product is bounded in absolute value by 1. By [3], p. 7, 

and so we see that the second term is l/(a + 1), completing the proof of the lemma. □ 
Remark: The case d = 1 can also be handled by a rearrangement lemma. 

Definition 12 Define a linear functional L on -L^(S/0) by L{g) := g(x)'>pi{x)dx{x) 
(= 7iai(5()) and set Opj = L{Kj). (Recall that tpij'ji and ai are defined in Lemma 2.6.) 

It follows from Lemmas 2.5, 2.6(7) and 2.8, (2.19) and an easy computation that 

\\ICjf - 1|U < OpjWf - 1|U for all / G P+{J). (2.20) 

In the following inequalities, we denote p := Opj. For / G V+{J), it also follows easily 
that 

L{lCjf - 1) > pL{f - 1) (2.21) 
and that there is a constant C3 such that for all / G ('P+(J)), 

l^(/)l<C3||/IU. (2.22) 

(We can of course take C3 to be 1, but we leave the condition written in this more general 
form for use as a hypothesis in Theorem 3.2.) 

Putting together the results of Lemmas 2.6 and 2.8, as well as (2.8), (2.9) and (2.19), 
gives the following corollary. 

Corollary 2.9 For all J >0, there is a constant C4 > such that for all f G V+{J), 

L{f) > C4\\f - 1\\a. (2.23) 

Proof. Fix / G V+{J). If / = Kji for some J' G (0, J], we argue as follows. As 
lle^V-iCa:)!!^ ^ gA (^hich we mentioned earlier) and Kji{x) = e"^'^^^^^ / J e'^'^^^^^d-x{x), we 
have 

Ili^j.-ilU = ll^j'IU-1 

. ^ 1 

jeJ'i^i(x)^x(a;) 
< e^-^'-l. 
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Next, 



1 g (J' 

k=0 

By Lemma 2.6(3), all terms in the sum are nonnegative and by Lemma 2.6(4), the A; = 1 
term is J'71. Hence L{Kji) > J'^xje^' . Since 



J'71 

inf , y, 2 J, -T > 0, 

J'e(o,J] e-' (e^-^ - 1) 



we can find a C4 in this case. 



Otherwise, by the fundamental recursion, we may represent / as 0(/Cj^/ii, . . . , ICj^hk) 
with each hi either in V+^J) or equal to 5^ and each Ji G (0, J\. Define gi = JCj^hi — 1. Let 
m := mio^ji<j ai{Kj/)/J2n>o(^niKj') which is strictly positive by the above. It follows 
that if hi G V+{J) (the case hi = 5^ is already done), 

L{gi) ^ ai{K,j^)ai{hiYil ^ 
(i^jja„(/ii)7„ 

by Lemma 2.6(7) and since ai(/ij)7i > o,n{hi)^n for all n > 1 by Lemma 2.8 and (2.19). 
Let h = nil ICjM- Then L{h) = L{1 + E^i 9t + Q), where Q is a sum of monomials in 
{gi}- Using > and (2.9), we have that L(Q) > 0, and hence 

k k 

L{h)>J2 H9i) >mjiJ2\\9i\\A- (2.24) 

i=i i=i 

On the other hand, for any B and M, there is C = C(M, B) such that if xi, . . . , G (0, M) 
with k < B, then 

-i+n(i+a^i) <cE^i- 

i=l 1=1 

Next, the positivity of the q^j implies /g h{x) dx(x) = ao(/i.) > 1. It follows that 

A: k 

\\h-l\\A = -l + \\h\\A = -l + ll\\gi + l\\A<Cj2\\gi\\A 

1=1 1=1 

for some constant C since H^j + im = ll^jm + 1 and Hgij + im clearly has a universal 
upper bound. [To see the latter statement, one notes that 

sup \\Kj,\\a< 00, 

0<J'<J 



25 



||i^j'*/IU< Ili^j'IU 

for any probability density function / G L^(S/0) (by Lemma 2.6(7)), (2.10) and the fact 
that we never have more than B terms in our pointwise products imply that 

sup ||/|U<( sup Ili^j'IU < 
fer+{J) \o<J'<J J 



Putting this together with (2.24) gives 



h-l\\A - C 



Finally, letting f = h/ (/g h{x) (ix(x)), we obtain 
\\h-l\\A > 

n>l 



E 

n>l 



h{x)dx{x) 



anif) 



/ h(x)d'x.(x) 
Js 



Hence 



and we're done. 



L{f) ^ L{h) ^ mji 

- C 



II/-1IU - \\h-l\ 



□ 



2.4 Distance regular graphs 

For the remainder of this section, we suppose that S is the vertex set of a finite, connected, 
distance regular graph, that d{x,y) is the graph distance, and that the energy H(x,y) 
depends only on d{x,y). The Potts models fit into this framework, with the respective 
graphs being the complete graph Kg on q vertices. All the results we need follow in 
fact from an even weaker assumption, namely that S is an association scheme. For the 
definition of association schemes and the proofs of the relevant results, see [4] or [23]. By 
developing the analogue of Lemma 2.6 for distance regular graphs, we will illustrate the 
extent to which our results are independent of the special properties of the Heisenberg 
model. 
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Wc have a distinguished clement S S and the measure dx will of course be normalized 
counting measure |S|^^ ^^^g (5^:. The spaces L'^{S) and L^(S/0) are then simply finite 
dimensional vector spaces with respective dimensions |S| and 1 + D, where D is the 
diameter of the graph S. 

Denote by M(S) the space of matrices with rows and columns indexed by S, thought 
of as linear maps from L^(S) to L'^{S). Associated with each function / G L^(S/0) is the 
matrix Mf G M(S) whose {x,y) entry is f{d{x,y)), whence the matrix Mf corresponds to 
the linear operator h h * f given in Section 2.2. The following analogue of Lemma 2.6 
is derived from Section 2.4 of [23]; a published reference is Section 2.3 of [4]. 

Lemma 2.10 There exists a basis of real -valued functions tpQ, . . . , ipD o/L^(S/0) orthog- 
onal under the inner product {f,g) = |S|~^ J2x fi^)9i^) following properties. 

(1) Mx) = 1. 

(2) i;j{6) = 1 = sup^ |V'i(x)| for all j. 

(3) = Y^^=Q Qiji^r for some nonnegative coefficients qlj with J2r lij ~ 

(4) ipi * ijjj = 7jSijtpj, where 7^ := iJjj * ijjjiO) = |S|"i Hx^'ji^)'^ ■ 

(5) The functions tpj are eigenf unctions of any convolution operator, that is, Mfipj = ctpj 
for any f € 1.2(8/6). 

(6) For f G L2(S/6), we have f = Ef=o «i(/)V'i; where aj := 7^"^|S|"^ /(^)V'j(a^)- 

(7) For f,ge L^{S/b), we have aj{f * g) = -fjaj{f)aj{g). 

(8) For f G L2(S/0) which is positive and nonincreasing, |(/, V'i)l ^ (/; V'l) fof each i>l. 

If we place the norm 5^jLol^i(/)l on (7^+ (J)), essentially all of the hypotheses in 
Theorems 3.1 and 3.2 (to come later) are immediate noting that all norms are equiva- 
lent on finite dimensional spaces. If the analogue of (2.19) holds, then letting L{g) : = 
\S\~^^^^^g{x)il)i{x) and both Opj and p to be L{Kj), then one can easily show that 
all of the hypotheses in Theorems 3.1 and 3.2 hold. As far as (2.19), it trivially holds for 
the complete graph where the diameter D is equal to 1 and in any case, the reader is left 
with only one condition to check. 

3 Two Technical Theorems 

We now state two general results from which Theorems 1.6 and 1.13 will follow. 



27 



Theorem 3.1 Let T be any tree (with bounded degree). For the d-dimensional Heisenberg 
model with d> 1, if J > and 

5r(r) • Opj < 1, 

then there is no robust phase transition for the parameter J, where Opj is given in Def- 
inition 12 fOpj implicitly depends on d). More generally, if J > and if {S, G, H) is 
any statistical ensemble with a norm \\ • \ \ on {V+{J)) satisfying (2.12), (2.14), (2.15) 
and (2.17) and there exists a number Opj € (0, 1) satisfying (2.20) and 6r(r) • Opj < 1, 
then there is no robust phase transition for the parameter J. 

Theorem 3.2 Let T be any tree (with bounded degree). For the d-dimensional Heisenberg 
model with d> 1, if J > and 

br{r) ■ Opj > 1, 

then there is a robust phase transition for the parameter J, where Opj is as above. More 
generally, if J > and if {S,G,H) is any statistical ensemble with a norm \\ ■ \\ on 
{V+{J)) satisfying (2.12), (2.15), (2.16), (2.17) and (2.18), and if L is a linear functional 
on {T'+{J)) which vanishes on the constants and satisfies (2.21), (2.22) and (2.23) for a 
constant p > 0, then br{T) ■ p > 1 implies a robust phase transition for the parameter J. 

To prove these results, we begin with a purely geometric lemma on the existence of 
cutsets of uniformly small content below the branching number. 

Lemma 3.3 Assume that br{T) < d. Then for all e > 0, there exists a cutset C such that 

x&C 

and for all v E C UC, 

E (^)l^l-l^l < 1- (3.1) 
xeCnT{v) 

Proof. Since br{T) < d, for any given e > 0, there exists a cutset C such that 

We can assume that C is a minimal cutset with this property with respect to the partial 
order Ci ^ C2 if for all v E Ci, there exists w E C2 such that v < w. We claim that this 
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cutset satisfies (3.1). If this property failed for some v, we let C be the modified outset 
obtained by replaeing C fl T{v) by v (and leaving C fl unchanged). As (3.1) clearly 
holds for w £ C, we must have that v ^ C in which case C ^ C. We then have 

< E (^)''' + (^)''' E (^)'''~'' 

xeCnr{v)^ xeCnr{v) 



'''''' 

x&Cnr{v)^ a;eCnr(?;) 

1, 



xec 
< e, 

contradicting the minimality of C since clearly C' :< C. □ 

We now proceed with the proofs of Theorems 3.1 and 3.2. 

Proof of Theorem 3.1. Since in Section 2.3 the Heisenberg models have been shown 
to satisfy all of the more general hypotheses of this theorem, we need only prove the last 
statement of the theorem where we have a given J > 0, a given || || on {V+{J)) and a 
given Op J satisfying the required conditions. By (2.12), for any e > 0, there is an eg > 
such that for all A; < i? and all hi, . . . ,hk G V+{J) with \\hi — 1|| < eo for all i, we have 
that 

k 

||0,(/ii, . . . , /ifc) - 111 < (1 + e) ^ \\hi - 111 . (3.2) 

i=l 

Choose e > so that (1 + e)~^ > br{T) ■ Opj and choose eo as above. By (2.14), we can 
choose J' > small enough so that \\Kji — 1|| < eoOpj. Use Lemma 3.3 to choose a 
sequence of cutsets {C„} for which 



Jim ^[(l + e)Op^]N=0 

xeCn 

and for all n and all v E Li Cn, 



Y: [(l + 6)Op^]N-H<l. (3.3) 
xec„nr{v) 



29 



We now show by induction that for ah n and ah v € C^, 

Wf&i'' - 111 < ^0 E [(1 + ^)Opjp-^'^ . (3.4) 

xeCnnr{v) 

Indeed, from Lemma 2.2, letting wi,. . . ,Wk be the children of v, 

Wfcni'^ ~ ^11 = iiO(^^('/cnit' • • • '-'^j^'/itt) - iii 

where J" is J if G and J' otherwise. When Wi G C„, the choice of J' guarantees that 
W^J'-' fcn m ~ ^11 — ^oOpj < Co, while when Wi ^ Cn, the induction hypothesis together 
with (3.3) guarantees that H/^^J'^^^ — 1|| < cq which implies that ||^j"/c„'t)t ~ "'"H — 
by (2.17). Hence, from (3.2), 

ii/if -i||<(i+^) E ii^^'/c::,tt-iii + (i+^) E ii^^/£tt-iii- 

The summands in the first sum are at most eoOpj while those in the second sum 
are by (2.20) at most Opj||/(^^'^. — 1||. Therefore using the induction hypothesis on the 
second term, we obtain 

k 

ll/S^-i|| < E(i + ^)^oOp, E [(i+e)op,]i^i-w 

i=i xec„nr{wi) 

= eo E [(l + e)Op,]N-H, 

xec„nr{v) 

completing the induction. Finally, the theorem follows by taking v = o, letting n — > oo, 
and using (2.15). □ 

For the proof of Theorem 3.2, it is easiest to isolate the following two lemmas. 

Lemma 3.4 Under the more general hypotheses of Theorem 3.2 (with a given J > 0, a 
given \\ \\ on {Vj^{J)), a given L and a given p satisfying the required conditions) , for all 
a > 0, there exists P > so that if hi, . . . , € Vj^{J) with k < B and \\hi — 1|| < /3 for 
each i, then 

k 

L {{(DS^jhu • • • , K.jhk)) - ij > E ^i^jhi - 1) 

1=1 
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Proof. In (2.12), choose P < I so that 

o{h) <h(l ^ ^""^ 



(1 + a)/ C3 

for all h € (0, /?), with C3 and C4 as in (2.22) and (2.23). If hi, . . . ,hk € ■p+(J) are such 
that \\hi - 1|| < /3, then \\ICjhi - 1\\ < P by (2.17). We can now write 

1 ^ 

O^iJCjhi, JCjhk) - 1 - -— ^ - 1) (3-5) 

as 

1 ^ ' 



^-J^^)U^jh,-l) + U (3.6) 



i=l 



where by assumption, 

\\U\\ < o(max||/Cj/ii - 1||) (3.7) 

i 

< (l-—^ — - ) — max||/Cj/ij - 1|| 
V [l + a)J C3 I 

s (-^)Sgii^^'--* 

Letting a be the quantity (3.5), we see that 



L{a) = L 



1 ^ ' 



Y^^lCjhi - 1) 



+ L{U) 



{1 + <^)JU 

k 

{l + a)J ^ 



> 
> 

by (2.22), (2.23) and (3.7), which is the conclusion of the lemma. □ 

The next lemma tells us that in "one step" , we can't move from being "far away" from 
uniform to being "very close" to uniform. 

Lemma 3.5 Under the more general hypotheses of Theorem 3.2 (with a given J > 0, a 
given \\ \\ on (P+(J)), a given L and a given p satisfying the required conditions), for all 
(3 > and J' G (0, J], there exists a 7 < /? such that if ||Ofe(/Cj//^i, . . . ,/Cj//%) — 1|| < 7 
with hi, . . . ,hk G V+{J) U and k < B and with J" being J if hi E V+{J) and J' if 
hi = Sq, then each hi is not and Yli=i ll^i ~ 1|| < P- 
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Proof. Choose 7 G (0, min{/3, 1/ci}) so that 

2C1C3S7 



2ci7 



PC2C4(1 - C17) 

and 

min{||Kj-l||,||i^j.-l||}> 

(1 - Cl7)C2 

where ci, C2, p, C3 and C4 come from (2.15), (2.16), (2.21), (2.22) and (2.23) respectively. We 
first show that if ^1, . . . , % G 7^+(J), with k < B, then ||Ofe(/ii, . . . , hk) — 1|| < 7 < 1/ci 
impUes that for all i 

\\h^ - 111 < '''' 



(1 - C17)C2' 



h, - 111 < c^'Wh, - 1|U < c^-i (^^^ - 1) 
i/maxHi/ii t\ -1 /maxO;,(/ii, . . . 



[Proof: 



^ VminHi/ij / ^ VminOjfc(/ii, ■ ■ ■ ,^fe) 
where the second inequality is straightforward and the third inequality comes from (2.18). 
Next, ||Ofe(/ii, ■ • • , %) — Ijj < 7 < 1/ci implies ||Ofe(^i) • • • ) ^fe)~l||oo < C17 which implies 
the last expression is at most 

-1 / I + ci7 _ _ „_i 2ci7 



- I '-2 • 

ci7 / 1 — ci7 



It follows that if \\(Dk{^j{'hi, ■ ■ ■ ,/Cj^'%) — Ijj < 7, then 

2ci7 



\lCjiihi — 1\\ < 



(1 - C17)C2 

for each i which implies that hi G 'P+{J) (as opposed to being 5q). Hence J^'' is J for all i. 
Now from (2.21)-(2.23) we have 

WKijhi - 1|| > 

and we obtain the conclusion of the lemma. □ 

Proof of Theorem 3.2. Since in Section 2.3 the Heiscnbcrg models have been shown 
to satisfy all of the more general hypotheses of this theorem, we need only prove the 
last statement of the theorem, where we have a given J > 0, a given || || on {V-\-{J)), 
a given L and a given p satisfying the required conditions. Choose an a > so that 
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br{T) ■ p > 1 + a. Choosing /3 from Lemma 3.4, wc have, under our assumptions, that for 
all hi, . . . ,hk & 'P+{J) with k < B and — 1|| < /3 for each i, 

^ [iOk(>^jhi, Kjhk)) - l] > ^C^J^^ - 1) ^ TT~ ^ " ^^-^^ 

+ *^ i=l + 

Now, if there is no robust phase transition, then by (2.16) there must exist J' G (0, J] and 
a sequence of cutsets {Cn} going to infinity such that hm„_^.oo ||/c„'o ~ M\ = 0- Using 
Lemma 3.5, choose 7 < /3 corresponding to /3 and J'. Next, by our choice of a, we have 

\x\ 



c 5- Vi + ay 



where the infimum is over all cutsets. We now choose n so that 

\\fc:,o - 111 < min{7, — -}. 

where C3 and C4 come from (2.22) and (2.23) respectively. We then define F' to be the 
component of the set 

{^;eCM|/£^'+-l||<7} 

that contains o and let C be the exterior boundary of V (that is, the set of a; ^ F' 
neighboring some y G F'). By the choice of 7, C C and for each v E CUC, the density 

fc:,v ism 

V+{J)n{f:\\f-l\\</3}. 
Using (3.8) and induction, we see that 



^(/S"-i)>E(rf^)"^(/S 



-1). 



xec 

By definition of F', C and / and the fact that L{f — 1) > C4II/ — 1|| on V+{J), we see that 

^(/S^ - 1) > C47^- 



Hence 



ii/c:'o'"-iii>|7/. 



This contradicts the choice of n, proving that there is indeed a robust phase transition. □ 



33 



4 Analysis of specific models 



4.1 Heisenberg models 

For the Heisenberg models, recall that S = S"^, d> 1, and H{x, y) = —x ■ y. The operator 
/Cj is convolution with the function Kj{x) = ce"^^"°, where c is a normalizing constant. 

Proof of Theorem 1.6. A change of variables shows that L{Kj) = p'^{J) and so the 
result follows from Theorems 3.1 and 3.2. □ 

For the rotor model, we now prove the equivalence of SB and SB+. 

Proof of Proposition 1.4. We have already seen the representation 

n>0 

for functions / G L^(S/0). In the case of the rotor model, where 8 = 5'^ and we take 
to be (1,0), the space L^(S/0) is the space of even functions of ^ G [— tt, tt] and ipn = 
cos(n6). We now turn to the full Fourier decomposition / = ?'n(/)e'"^, where 

Let C be any cutset and 5 be a set of boundary conditions on C. Let JT" be any set of 
interaction strengths. It suffices to show that 

Wfc^w ~ M\oo < Wfc^w ~ M\oo 

for all w G C*. For v E C and n E let Xy^n = bn{Kj(^x),S(v)) where e is the edge from v 
to its parent. 

Claim: For all y G C*, the Fourier coefficients Jq^ e^""^ dfi'^'y{6), which we denote by 
{uy^n '■ n- € Z}, are sums of monomials in {x^^n}v&c,n<^z with nonnegative coefficients. 
Proof: Let w ^ C"^ have children wi^ . . . ,Wr G C* and iv^+i, . . . , G C. Then the Fourier 
coefficients {uw,n ■ n ^} are the convolution of the k — r series {x^^n '■ n € as v 
ranges over Wr+i, . . . ,Wk, also convolved with the series {hn{Kj(^-^)uy^n '■ n G as v 
ranges over wi,. . . ,Wr- Since bn{Kj) > 0, this establishes the claim via induction and the 
fundamental recursion. 

Now write for the Fourier coefficients 6„(i^j-(e)) where e is as before. Since 
Kj^^ic,{x) = Kj{e~^"x), it follows that 
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But x^^j^ is real because Kj is even, and has been shown to be nonnegative. Thus 

and it follows from the claim that each 

ii'w^n hcis modulus bounded. cibov6 by th.6 corre- 
sponding ^ when plus boundary conditions are taken. Hence 

Wf^i - llloo < Wf^i - 1|U < E < E <n = - 1|U = - llloo, 

proving the lemma. □ 

Remark: Although we have used special properties of the Fourier decomposition on L?{S^), 
there exist similar decompositions for S"^. We believe that a parallel argument can prob- 
ably be constructed, bounding the modulus of the sum of the coefficients of spherical 
harmonics of a given order by the coefficients one obtains for the analogous monomials in 
the values an{Kj^^-^), whose coefficients are necessarily nonnegative by the nonnegativity 
of the connection coefficients qly Thus we are led to state: 

Problem 4.1 Prove a version of Proposition I.4 for general Heisenberg models on trees. 



4.2 The Potts model 



Proof of Theorem 1.13. We will obtain this result from Theorems 3.1 and 3.2. For 

(i) , letting || || be the L^o norm on and Opj = aj, all of the hypotheses in 
Theorem 3.1 except (2.20) are clear. The function Kj is given by 

Kj{x) = cexp(J(25^,o - 1)) 

where c = {e'^ + (<Z ~ l)e^'^)^^. The operator JCj is linear and 

ICjSj = ce'^Sj + E ce~^ 5i . 

Hence in the basis (5o, • • • , the matrix representation of /Cj is c(e^ — e~"^)/ + ce~'^ M 
where M is the matrix of all ones. On the orthogonal complement of the constant functions, 
Kj is c(e-^ - e--^)/, and (2.20) follows, proving (i) by an application of Theorem 3.1. 

For (ii), let || || be the same as above, p = aj and L{h) = h{0) — h{l). It is then 
immediate to check that all of the hypotheses in Theorem 3.2 hold and we may conclude 

(ii) by an application of Theorem 3.2. □ 
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5 Proof of Theorem 1.10. 



By Proposition 1.3 and the fact that any subtree of a tree with branching number 1 also 
has branching number 1, it suffices to show: 

For for any T with 6r(r) = 1, and any bounded J, there is a sequence of 
cutsets {Cn} such that for any sequence {6n} of boundary conditions on {C„}, 

lim \\f^'^" -1\\^=0. 

It is convenient to work with a different measure of size, the Max/Min measure, defined 
as follows. (This arose already in the proof of Lemma 3.5.) For any continuous strictly 
positive function / on S, let 

max^gs/(a;) 

It is immediate to see: 

Lemma 5.1 For any sequence {hn} of continuous probability densities, \\hn — l||oo 
if and only if log ||/in||M — 0. 

Next, we examine the effect of /Cj on ||/||m- 

Lemma 5.2 For any statistical ensemble {S,G,H), any Jmax and any T > there is an 
e > such that for any continuous strictly positive function f with ||/||m < T, and any 

log ||/Cj/||M<(l-e) log II/IIm . 

Proof. Fix H, J and / and assume without loss of generality that J f dx = 1 since the 
Max/Min measure is unaffected by multiplicative constants. Let [a, b] be the smallest 
closed interval containing the range of / and [c, d] contain the range of Kj with a, c > 0. 
Since / is a probability density, a < 1 < 6 (we rule out the trivial case / = 1). Since 
Kj = c + (1 — c)g for some probability density g, it follows that for any x G S, 

c + (1 - c)a < /Cj/(x) < c + (1 - c)b. 

As J varies over [0, </m£ix]! miiia; Kj{x^ is bounded below by some cq > 0, so for all such </, 

Co + (1 - co)a < KLjf{x) < cq + (1 - co)6 



36 



and so 

Co + (1 - coja 
Setting i? = ||/||m — 1, we have 6 = (1 + K)a and so 

lir fll ^ co + (l-co)(l + ii:)a (1 - co)a ^ , , p,, . 

/Cj/ M < — T = 1 + ti — — < 1 + - Co) . 

Co + (1 - Coja Co + (1 - Coja 

Thus 

||/Cj/||m<1 + (1-co)(||/||m-1) . (5.1) 

The function log(l + (1 — cq)u) / log(l + u) is bounded above by some 1 — e < 1 as n varies 
over (0,T — 1], and setting u = ||/||m — 1 in (5.1) gives 

log ||/Cj/||m < log(l + (1 - co)(|i/||M - 1)) < (1 - e) log II/IIm , 

proving the lemma. □ 

Proceeding with the proof of Theorem 1.10, let C be a cutset with no vertices in the 
first generation, 

dC = {v -.Iw (^C with V w}, 
and 6 be defined on C. Clearly, for continuous strictly positive functions hi, . . . ,hk, 

k 

||Q(/ll, . . . , /iA;)||m < n \\^i\\M ■ 
i=l 

We have also previously seen (Lemma 2.3) that all densities that arise are uniformly 
bounded away from and oo and hence there is a uniform bound on the || \\m that arise. 
We can therefore choose e from Lemma 5.2. Next for any v E \ dC, applying the 
fundamental recursion gives 

log 1 1 fc^y 1 1 M = log 1 1 C^JivwT) fc^wi ' • • • ' ^J{vw^) fc^v 



< X] log \\K^J{vw^)fc,li \\M 
1=1 



< X(l-6)log||/g;4||M. 
i=l 

Working backwards, we find that for any cutset C, 

iogii/g;„'iiM< E (i-e)i'"'iogii/g;;fiiM. 

wedc 
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Since br{T) = 1 one can choose a sequence of cutsets {Cn} such that X)«)eac„(l ~^)'^' ~^ 0- 
The uniform bound on ||/(;^||m imphes that for any sequence of functions 6n on C„, 

which along with Lemma 5.1 proves the theorem. □ 

OUe Haggstrom pointed out to us that this result could also be obtained using ideas 
from disagreement percolation. 

6 Proof of Theorem 1.14. 

While we assume that q is an integer, the case of nonintegral q can be made sense of via the 
random cluster representation, and it is worth noting here that the break between q = 2 
and q = 3 happens at = 2 + e. See [11] for a discussion of the qualitative differences 
between the random cluster model on a tree when q <2 as opposed to q > 2. 

Lemma 6.1 Assume that all of the hypotheses of Theorem 3.1 are in force (in particu- 
lar, (2.20) and br{T) ■ Opj < 1 hold and so there is no RPT for the parameter J) and 
in addition that sup^gg < oo and (2.20) holds for all f £ ^{J) (instead of just 

V+{J)). Then there is a tree T' with hr{T') = hr{T) such that V has no PT for the 
parameter J. 

Proof. We mimic the proof of Theorem 3.1. Choose e, eo and cutsets {C„} as in the 
proof of Theorem 3.1 where we can assume that the cutsets {C„} are disjoint. Choose an 
integer m sufficiently large so that the m-fold iterated convolution operator /Cy satisfies 
II^T'^y ~ 1|| ^ ^oOpj. For each increasing sequence {n{k) : A; = 1, 2, . . .} of integers, 
define a tree V by replacing each edge from an element of C„(;t) to its parent by m edges 
in series, for all cutsets in the sequence {Cn[k)}- It is not too great an abuse of notation to 
let Cn denote the cutset of V consisting of the same vertices as before. It is now possible 
to establish (3.4) for all v E D, where D is the set of vertices in T' that are in C* and in 
r (i.e., are not in a chain of parallel edges that was added). The only adjustment in the 
proof is as follows. Use Lemma 2.2 to represent fc^^ in terms of fc^^, where w are the 
children of u in F rather than in F', i.e., we leap the whole chain of m edges at once. Then 
the case w e Cn that was handled by the choice of J' is replaced by a case w eT'\T, which 
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is handled by the choice of m. In fact, (3.4) holds when + is replaced by any boundary 
condition as the exact same proof shows. By choosing {n{k)} sufficiently sparse, we can 
ensure that br{T') = br{T). Fixing any such choice of {n{k)}, it follows that there is no 
phase transition by the above together with Proposition 1.3. □ 

We proceed now with the description of a counterexample. For Fi, we choose the ho- 
mogeneous binary tree, where each vertex has precisely 2 children. Recall from Section 4.2 
that under + boundary conditions, the functions f^!^ all lie in a one-dimensional set. The 
most convenient parameterization for the segment is by the log-likelihood ratio of state 
to the other states. Thus the probability measure a5o + X]i=i ((1 ~ ^)/('? ^ is mapped 
to the value log[((/ — l)a/{l — a)]. Let g{v) denote the log-likelihood ratio at v under some 
interaction strength and boundary conditions. The recursion (2.5) of Lemma 2.2 boils 
down to 

g{v) = 4>{9{w)y, ct>{z) := log i_p i_p , 

where 

p:=e-'/ie' + iq-l)e-'). (6.1) 
Taking a Taylor expansion to the second order gives 

'^(^) ={P- ^) ^ + 2(gll)2 [P(g - - (« - 1) + (1 - P)]^' + 0{Z'). 

To see that the second derivative is positive at for g > 2, first take the g-derivative of 
the coefficient which is [q + 2p — 3](1 —p)/{2{q — 1)^). The definition of p and the fact 
that J > imply that p>l/q> l/{2{q - 1)). Since x + l/(x - 1) - 3 > on (2, oo) and 
2p> l/(q' — 1), it follows that the coefficient has a positive g-derivative for q>2^ and 
is therefore positive for all g > 2. (This also implies that for g G (2 — 5, 2) for some 5, the 
function (f) is concave (see [20] for a detailed analysis of the critical case g = 2).) 

The Taylor expansion gives ^'(0) = p — (1 — p)/(g — 1). Note that po := {<l + l)/(2g) 
satisfies po — (1 —Po)/{q ~ 1) = 1/2- The value of po is chosen to make i?!>'(0) = 1/2; 
by convexity of (f) near zero, there is an interval / := (po — e,po) such that for p E I, 
the equation (f){z) = z/2 has a positive solution, call it z{p). Take e > so small that 
Po — e > 1/q. For any 1 > p> 1/g there is a unique J > such that (6.1) holds. If p G /, 
then z{p) is a fixed point for the function 2(f) and it is easy to see by induction that under 
-|- boundary conditions on the binary tree, one will always have g{v) > z{p). Thus we 
have shown that Fi has a phase transition for any J such that p E I. 



39 



To find we examine the connection between po Siud \\ICj\\ where for the rest of the 
proof, the operator norm refers to the L°° norm on the orthogonal complement of the 
constants. Observe that 

1 — p €~'^ 

^ " " + - l)e-J ~ eJ + {q- l)e-J " l'^-^" 

by the computation in Section 4.2. Thus po is chosen to make \\JCj\\ = 1/2 and for any 
p & I, ||/Cj|| < 1/2. Fix any J so that p & I, and let T be any tree with 

2 = 6r(ri) < 6r(r) < ||x:j||-i. 

Let r' be as in Lemma 6.1 and set r2 = F'. Then there is no phase transition on F2 
for the chosen parameters, and since we have seen there is a phase transition for Fi, this 
completes the proof of Theorem 1.14. □ 
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